Speech formant frequency and bandwidth tracking using multiband energy demodulation

نویسندگان

  • Alexandros Potamianos
  • Petros Maragos
چکیده

In this paper, the amplitude and frequency AM–FM modulation model and a multiband demodulation analysis scheme are applied to formant frequency and bandwidth tracking of speech signals. Filtering by a bank of Gabor bandpass filters is performed to isolate each speech resonance in the signal. Next, the amplitude envelope AM and instantaneous frequency FM are estimated for each band using the energy separation algorithm ESA . Short-time formant frequency and bandwidth estimates are obtained from the instantaneous amplitude and frequency signals; two frequency estimates are proposed and their relative merits are discussed. The short-time estimates are used to compute the formant locations and bandwidths. Performance and computational issues of the algorithm are discussed. Overall, multiband demodulation analysis MDA is shown to be a useful tool for extracting information from the speech resonances in the time–frequency plane. © 1996 Acoustical Society of America.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A new multicomponent AM-FM demodulation with predicting frequency boundaries and its application to formant estimation

In this paper, a method using dynamic programming to predict frequency boundaries is proposed for the joint demodulation of amplitude modulation (AM) and frequency modulation (FM) for speech signals. Because of the existence of modulations in speech signal, an algorithm called energy separation algorithm (ESA) has been developed to track the energy needed by a source to produce the speech signa...

متن کامل

Instantaneous Energy Operators : Applications To

The nonlinear energy operator (x) _ x] 2 ? x x and its discrete-time counterpart have found numerous applications including development of the energy separation algorithm (ESA) for demodulat-ing AM-FM signals, tracking speech modulations, and detecting various events in nonstationary signals. In this paper we rst present some improvements on the energy operator and ESA when applied to demodulat...

متن کامل

A multimodal density function estimation approach to formant tracking

We address the problem of robust formant tracking in continuous speech. We propose the robust statistical model of t-distribution mixture density (tMM) operating on the “pyknogram” obtained through a multiband AM-FM demodulation technique. The statistical model of the pyknogram is shown to be more-effective to handle the variability in the signal processing stage. The t-mixture density estimati...

متن کامل

Speech analysis and synthesis using an AM-FM modulation model

In this paper, the AM{FM modulation model is applied to speech analysis, synthesis and coding. The multiband demodulation pitch tracking algorithm is proposed that produces smooth and accurate fundamental frequency contours. The AM{ FM modulation vocoder represents speech as the sum of resonance signals modeled by their amplitude envelope and instantaneous frequency signals. E cient modeling an...

متن کامل

Tracking Formant Trajectory of Continuous Chinese Whispered Speech with Hidden Dynamic Model Based on Dynamic Target Orientation

Aimed at the characteristics of Chinese whispered speech formants, i.e., migrating to highfrequency, increased bandwidth, and increased spurious peaks and merged peaks, a method of tracking the formant trajectory of continuous Chinese whispered speech using the Hidden Dynamic Model (HDM) with dynamic target orientation was put forward in this study. The calculation proceeded as follows: firstly...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1995